On nonstationary hidden Markov modeling of speech signals
نویسندگان
چکیده
We propese an exact maximum likelihood (ML) approach for hidden Markov modeling of speech signals using models with mixtures of Gaussian autoregressive (AR) output probability distributions. This approach differs from the commonly used approach in two aspects. First, the parameters of the AR models are calculated using the exact, rather than the asymptotic, form of the likelihood function. Second, the gain of each AR model as weil as its shape is estimated and used during the recognition phase. Since the asymptotic likelihood is appropriate only for sources which are stationary in some sense, the ML approach taken here can be considered as an approach for nonstationary modeling. The proposed approach was tested on the task of recognizing isolated versions of the Englieh alphabet spoken by four different speakers by a system which was simultaneously trained for the four talkers ( multi-speaker recognizer). This approach results in a recognition accuracy which is comparable to that obtained by the asymptotic ML approach.
منابع مشابه
Estimation of nonstationary hidden Markov models by MCMC sampling
Hidden Markov models are very important for analysis of signals and systems. In the past two decades they have been attracting the attention of the speech processing community, and recently they have become the favorite models of biologists. Major weakness of conventional hidden Markov models is their inflexibility in modeling state duration. In this paper, we analyze nonstationary hidden Marko...
متن کاملAn MCMC sampling approach to estimation of nonstationary hidden Markov models
Hidden Markov models (HMMs) represent a very important tool for analysis of signals and systems. In the past two decades, HMMs have attracted the attention of various research communities, including the ones in statistics, engineering, and mathematics. Their extensive use in signal processing and, in particular, speech processing is well documented. A major weakness of conventional HMMs is thei...
متن کاملHierarchical Classification Tree Modeling of Nonstationary Noise for Robust Speech Recognition
Noise robustness is a key issue in successful deployment of automatic speech recognition systems in demanding environments such as hospital operating rooms. Perhaps the most successful way to overcome the additive noise obstacle is to employ a model adaptation scheme built around a set of dedicated clean speech and noise-only statistical models. Existing recognizer designs generally rely on rel...
متن کاملNonstationary-state hidden Markov model representation of speech signals for speech enhancement
A novel formulation of the nonstationary-state hidden Markov model (NS-HMM), employed as the speech model and serving as the theoretical basis for the construction of a speech enhancement system, is presented in this paper. The NS-HMM is used as a compact, parametric model, generalized from the stationary-state HMM, for describing clean speech statistics in the construction of the minimum mean-...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کامل